WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/12, C07K 14/705, C12N 5/10, 
15/57, 9/48, 9/14, 15/55 



A3 



(11) International Publication Number: 
(43) International Publication Date: 



WO 98/21328 

22 May 1998 (22.05.98) 



(21) Internationa] Application Number: PCT/JP97/04056 

(22) International Filing Date: 7 November 1997 (07.1 1.97) 



(30) Priority Data: 

8/301429 



13 November 1996 (13.11.96) JP 



(71) Applicants (for all designated States except US): SAGAMI 

CHEMICAL RESEARCH CENTER [JP/JP]; 4-1, 
Nishi-Ohnuma 4-chome, Sagamihara-shi, Kanagawa 229 
(JP). PROTEGENE TNC. [JP/JP]; 2-20-3, Naka^cho, 
Meguro-ku, Tokyo 153 (JP). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): KATO, Seishi [JP/JP]; 
3-46-50, Wakamatsu, Sagamihara-shi, Kanagawa 229 (JP). 
SEKINE, Shingo [JP/JP]; 4-4-1, Nishi-Ohnuma, Sagami- 
hara-shi, Kanagawa 229 (JP). YAMAGUCHI, Tomoko 
[JP/JP]; 5-13-11, Takasago, Katsushika-ku, Tokyo 125 
(JP). KOBAYASHI, Midori [JP/JP]; 647-2, Chougo, Fu- 
jisawa-shi, Kanagawa 252 (JP). 

(74) Agents: AOYAMA, Tamotsu et al.; Aoyama & Partners, 
IMP Building, 3-7, Shiromi 1-chome, Chuo-ku, Osaka-shi, 
Osaka 540 (JP). 



(81) Designated States: AU, CA, JP, MX, US, European patent 
(AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE). 



Published 

With international search report 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 

(88) Date of publication of the international search report: 

20 August 1998 (20.08,98) 



(54) Title: HUMAN PROTEINS HAVING TRANSMEMBRANE DOMAINS AND DNAS ENCODING THESE PROTEINS 



(57) Abstract 

Proteins containing any of the amino acid sequences represented by Sequence No. 1 to Sequence No. 2 or by Sequence No. 4 to 
Sequence No. 25 and DNAs encoding said proteins exemplified by cDNAs containing any of the base sequences represented by Sequence 
No. 26 to Sequence No. 50. Said proteins can be provided by expressing cDNAs encoding human proteins having transmembrane domains 
and recombinants of these human cDNAs. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







INTERNATIONAL SEARCH REPORT 



Interna al Application No 

PCT/JP 97/04056 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/12 C07K14/705 C12N5/10 
C12N9/14 C12N15/55 



C12N15/57 C12N9/48 



Aooording to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by 

IPC 6 C12N C07K 



classification symbols) 



Documentation searched other than minimum documentation to the extent that such documents are included in th e fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 



Citation of document with indication, where appropriate, of the relevant passages 



Relevant to olaim No. 



JOURNAL OF MOLECULAR BIOLOGY, 

vol. 157, no. 1, 5 May 1982, 

pages 105-132, XP000609692 

KYTE J ET AL: M A SIMPLE METHOD FOR 

DISPLAYING THE HYDROPATHIC CHARACTER OF A 

PROTEIN" 

cited in the application 
see abstract 

SCIENCE, 

vol. 272, 10 May 1996, 

pages 872-877, XP002031517 

FENG Y ET AL: "HIV-1 ENTER C0FACT0R: 

FUNCTIONAL CDNA CLONING OF A 

SEVEN-TRANSMEMBRANE G PROTEIN-COUPLED 

RECEPTOR" 

cited in the application 
see the whole document 

-/-- 



1-5 



1-5 



| X] F w*w doouments ant listed in the continuation of box C. 



^| Patent family members are listed in 



" Special categories of cited documents ; 

A" document defining the general state of the art which it not 
considered to be of particular relevance 

E* earlier document but published on or after the International 
filing date 

•L* document whiort may throw doubts on priority claim(s) or 
which is oiled to establish the publication date of another 
citation or other special reason (as speoified) 

O" document referring to an oral disclosure, use, exhibition or 
other means 

P' document published prior to the international filing date but 
later than the priority date claimed 



*T* later document published after the international filing date 
or priority date and not in conflict with the application but 
orted to understand the principle or theory underlying the 
invention 

'X* document of particular relevance; the claimed invention 
cannot be considered novel or oannot be considered to 
involve an inventive step when the document is taken alone 

•V dooument of particular relevanoe; the ofaimed invention 

oannot be considered to involve an inventive step when the 
document is combined with one or more other such doou- 
menta, such combination being obvious to a person skilled 
in the art 

*4" document member of the same patent family 



Date of the actual completion of the international search 



12 March 1998 



Name and mailing address of the ISA 

European Patent Offioe, P.B. 581 6 Patentiaan 2 
NL ■ 2280 HV Rijswijk 
Tel. (+31-70) 34O-2O40, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Form PCT/ISA/21Q {< 



Date of mailing of the international search report 



0 3. 07. 98 



Authorixsd officer 



ESPEN, J 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



Interm al Application No 

PCT/JP 97/04056 



C,(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category fl Citation of document, with indication, where appropriate, of the relevant 



Relevant to olaim No. 



p.x 



J BIOL CHEM, APR 12 1996, 271 (15) 
P8549-52, UNITED STATES, XP002O5879O 
HOLLOWAY MP ET AL: "A hydrophobic domain 
of Ca2+-modulating cyclophilin ligand 
modulates calcium influx signaling in T 
lymphocytes." 
see abstract 

BIOCHEMICAL AND BIOPHYSICAL RESEARCH 
COMMUNICATIONS., 

vol. 168, 1990, ORLANDO, FL US, 
pages 574-579, XP002058791 
APPERSON M ET AL: "A yeast protein, 
homologous to the proteolipid of the 
chromaffin granule proton -ATPase, is 
important for cell growth" 
see figure 2 

EMHUM1 Database entry HSD052 
Accession number D89052; 07 Dec 1996 
NISHIGORI H ET AL: 'Cloning and 
chromosomal localization of the gene 
encoding a protein homologous to the yeast 
protein PPA1, an proton-ATPase-like 
protein' 
XP002058792 
see sequence 



1-5 



1-5 



1-5 



1 



Forni PCT/ISA/210 (continuation of looond ihoot) (July 1992) 



page 2 of 



2 



Int itionai application No. 



INTERNATIONAL SEARCH REPORT 



PCT/JP 97/04056 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 1 7(2)(a) for the following reasons: 



because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
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SEQ ID No 1 

2. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 27 and 52 and protein relating to 
SEQ ID No 2 

3. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 28 and 53 and protein relating to 
SEQ ID No 3 

4. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 29 and 54 and protein relating to 
SEQ ID No 4 

5. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 30 and 55 and protein relating to 
SEQ ID No 5 

6. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 31 and 56 and protein relating to 
SEQ ID No 6 

7. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 32 and 57 and protein relating to 
SEQ ID No 7 

8. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 33 and 58 and protein relating to 
SEQ ID No 8 

9. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 34 and 59 and protein relating to 
SEQ ID No 9 

10. Claims: Claims 1-5 in part 
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DNAs relating to SEQ ID No 35 and 60 and protein relating to 
SEQ ID NO 10 

11. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 36 and 61 and protein relating to 
SEQ ID No 11 

12. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 37 and 62 and protein relating to 
SEQ ID No 12 

13. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 38 and 63 and protein relating to 
SEQ ID No 13 

14. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 39 and 64 and protein relating to 
SEQ ID No 14 

16. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 41 and 66 and protein relating to 
SEQ ID No 16 

17. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 42 and 57 and protein relating to 
SEQ ID No 17 

18. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 43 and 68 and protein relating to 
SEQ ID No 18 

19. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 44 and 69 and protein relating to 
SEQ ID No 19 

20. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID '.No 45 and 70 and protein relating to 
SEQ ID No 20 
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21. Claims: Claims 1-5 in part 

ONAs relating to SEQ ID No 46 and 71 and protein relating to 
5EQ ID No 21 

22. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 47 and 72 and protein relating to 
SEQ ID No 22 

23. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 48 and 73 and protein relating to 
SEQ ID No 23 

24. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 49 and 74 and protein relating to 
SEQ ID No 24 

25. Claims: Claims 1-5 in part 

DNAs relating to SEQ ID No 50 and 75 and protein relating to 
SEQ ID No 25 
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TCATGCACCC TGTACAGTTG CCACGTTACT GCCTTTTTTA AAAATATATT TGACAGAAAC 1690 
CAGGTGCCTT CAGAGGCTCT CTGATTTAAA T 1721 



Sequence No, : 68 

Sequence length: 1504 

Sequence type: Nucleic acid 

Strandedness : Double 

Topology: Linear 

Sequence kind: cDNA to mRNA 

Original source: 

Organism species : Eomo sapiens 

Cell kind: Stomach cancer 

Clone name: HP10297 
Sequence characteristics 

Code representing characteristics: CDS 

Existence site: 63.. 614 

Characterization method: E 
Sequence description 

CTTTTGCGGC TGCAGCGGGC TTGTAGGTGT CCGGCTTTGC TGGCCCAGCA AGCCTGATAA 60 
GC ATG AAG CTC TTA TCT TTG GTG GCT GTG GTC GGG TGT TTG CTG GTG 107 
Met Lys Leu Leu Ser Leu Val Ala Val Val Gly Cys Leu Leu Val 
1 5 10 15 

CCC CCA GCT GAA GCC AAC AAG AGT TCT GAA GAT ATC CGG TGC AAA TGC 155 
Pro Pro Ala Glu Ala Asn Lys Ser Ser Glu Asp He Arg Cys Lys Cys 

20 25 30 

ATC TGT CCA CCT TAT AGA AAC ATC AGT GGG CAC ATT TAC AAC CAG AAT 203 
He Cys Pro Pro Tyr Arg Asn He Ser Gly His He Tyr Asn Gin Asn 

35 40 45 

GTA TCC CAG AAG GAC TGC AAC TGC CTG CAC GTG GTG GAG CCC ATG CCA 251 
Val Ser Gin Lys Asp Cys Asn Cys Leu His Val Val Glu Pro Met Pro 

50 55 60 

GTG CCT GGC CAT GAC GTG GAG GCC TAC TGC CTG CTG TGC GAG TGC AGG 299 
Val Pro Gly His Asp Val Glu Ala Tyr Cys Leu Leu Cys Glu Cys Arg 

65 70 75 

TAC GAG GAG CGC AGC ACC ACC ACC ATC AAG GTC ATC ATT GTC ATC TAC 347 
Tyr Glu Glu Arg Ser Thr Thr Thr He Lys Val He He Val He Tyr 
80 85 90 95 

CTG TCC GTG GTG GGT GCC CTG TTG CTC TAC ATG GCC TTC CTG ATG CTG 395 
Leu Ser Val Val Gly Ala Leu Leu Leu Tyr Met Ala Phe Leu Met Leu 

100 105 HO 

GTG GAC CCT CTG ATC CGA AAG CCG GAT GCA TAC ACT GAG CAA CTG CAC 443 
Val Asp Pro Leu He Arg Lys Pro Asp Ala Tyr Thr Glu Gin Leu His 
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115 120 125 

AAT GAG GAG GAG AAT GAG GAT GCT CGC TCT ATG GCA GCA GCT GCT GCA 491 
Asn Glu Glu Glu Asn Glu Asp Ala Arg Ser Met Ala Ala Ala Ala Ala 

130 140 
TCC CTC GGG GGA CCC CGA GCA AAC ACA GTC CTG GAG CGT GTG GAA GGT 539 
Ser Leu Gly Gly Pro Arg Ala Asn Thr Val Leu Glu Arg Val Glu Gly 

145 150 155 

GCC CAG CAG CGG TGG AAG CTG CAG GTG CAG GAG CAG CGG AAG ACA GTC 587 
Ala Gin Gin Arg Trp Lys Leu Gin Val Gin Glu Gin Arg Lys Thr Val 
160 165 170 175 

TTC GAT CGG CAC AAG ATG CTC AGC TAGATGGGCT GGTGTGGTTG GGTCAAGGC 640 
Phe Asp Arg His Lys Met Leu Ser 
180 

CCCAACACCA TGGCTGCCAG CTTCCAGGCT GGACAAAGCA GGGGGCTACT TCTCCCTTCC 700 
CTCGGTTCCA GTCTTCCCTT TAAAAGCCTG TGGCATTTTT CCTCCTTCTC CCTAACTTTA 760 
GAAATGTTGT ACTTGGCTAT TTTGATTAGG GAAGAGGGAT GTGGTCTCTG ATCTCTGTTG 820 
TCTTCTTGGG TCTTTGGGGT TGAAGGGAGG GGGAAGGCAG GCCAGAAGGG AATGGAGACA 880 
TTCGAGGCGG CCTCAGGAGT GGATGCGATC TGTCTCTCCT GGCTCCACTC TTGCCGCCTT 940 
CCAGCTCTGA GTCTTGGGAA TGTTGTTACC CTTGGAAGAT AAAGCTGGGT CTTCAGGAAC 1000 
TCAGTGTCTG GGAGGAAAGC ATGGCCCAGC ATTCAGCATG TGTTCCTTTC TGCAGTGGTT 1060 
CTTATCACCA CCTCCCTCCC AGCCCCAGCG CCTCAGCCCC AGCCCCAGCT CCAGCCCTGA 1120 
GGACAGCTCT GATGGGAGAG CTGGGCCCCC TGAGCCCACT GGGTCTTCAG GGTGCACTGG 1180 
AAGCTGGTGT TCGCTGTCCC CTGTGCACTT CTCGCACTGG GGCATGGAGT GCCCATGCAT 1240 
ACTCTGCTGC CGGTCCCCTC ACCTGCACTT GAGGGGTCTG GGCAGTCCCT CCTCTCCCCA 1300 
GTGTCCACAG TCACTGAGCC AGACGGTCGG TTGGAACATG AGACTCGAGG CTGAGCGTGG 1360 
ATCTGAACAC CACAGCCCCT GTACTTGGGT TGCCTCTTGT CCCTGAACTT CGTTGTACCA 1420 
GTGCATGGAG AGAAAATTTT GTCCTCTTGT CTTAGAGTTG TGTGTAAATC AAGGAAGCCA 1480 
TCATTAAATT GTTTTATTTC TCTC 150A 



Sequence No. : 69 
Sequence length: 532 
Sequence type: Nucleic acid 
Strandedness: Double 
Topology: Linear 
Sequence kind: cDNA to mRNA 
Original source: 

Organism species: Homo sapiens 

Cell kind: Stomach cancer 

Clone name: HP10299 
Sequence characteristics 

Code representing characteristics: CDS 

Existence site: 93.. 443 

Characterization method: E 
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Claims 



1. A protein containing any of the amino acid sequences 
represented by Sequence No. 1 to Sequence No. 2 or by Sequence 
No. 4 to Sequence No. 25. 

2. A DNA encoding any of the proteins as described in 
Claim 1. 

3 . A cDNA containing any of the base sequences represented 
by Sequence No. 26 to Sequence No. 50. 

4. A cDNA as described in Claim 3 which comprises any of 
the base sequences represented by Sequence No. 51 to Sequence No. 
75. 

5. A transformed eukaryotic cell capable of expressing any 
of DNAs as described in Claim 2 to 4 and producing a protein as 
described in Claim 1 . 



PRIOR APPLICATION NUMBER: PCT/ JP97/04 056 
PRIOR FILING DATE: 1997-11-07 
NUMBER OF SEQ ID NOS : 91 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 68 
LENGTH: 1504 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME / KEY : CDS 
LOCATION: (63) . . (611) 
US-09-284-320-68 

Query Match 99.7%; Score 1494.2; DB 17; Length 1504; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 14 96; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 
Qy 1 GCGGCTGCAGCGGGCTTGTAGGTGTCCGGCTTTGCTGGCCCAGCAAGCCTGATAAGC ATG 60 

IMIMIIIIIIIMIIIIIIIIMIIIIIIIMIIIMIIIIIIIIIIIIIIMIMM 

Db 6 GCGGCTGCAGCGGGCTTGTAGGTGTCCGGCTTTGCTGGCCCAGCAAGCCTGATAAGC ATG 65 

Oy 61 AAGCTCTTATCTTTGGTGGCCGTGGTCGGGTGTTTGCTGGTGCCCCCAGCTGAAGCCAAC 12 0 

1 1 1 1 Illlll Mill I Mill MM II I II I Mill II I 

Db 66 AAGCTCTTATCTTTGGTGGCTGTGGTCGGGTGTTTGCTGGTGCCCCCAGCTGAAGCCAAC 12 5 

Qy 121 AAGAGTTCTGAAGATATCCGGTGCAAATGCATCTGTCCACCTTATAGAAACATCAGTGGG 18 0 

M 1 1 II i I M 1 1 1 1! 1 1 i 1 1 M 1 ! 1 1 M 1 1 1 M 1 1 1 1 1 ! 1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 M I 

Db 12 6 AAGAGTTCTGAAGATATCCGGTGCAAATGCATCTGTCCACCTTATAGAAACATCAGTGGG 185 

Qy 181 CATATTTACAACCAGAATGTATCCCAGAAGGACTGCAACTGCCTGCACGTGGTGGAGCCC 24 0 

M 1 1 1 I I I I 

Db 18 6 CACATTTACAACCAGAATGTATCCCAGAAGGACTGCAACTGCCTGCACGTGGTGGAGCCC 24 5 

Oy 241 ATGCCAGTGCCTGGCCATGACGTGGAGGCCTACTGCCTGCTGTGCGAGTGCAGGTACGAG 3 00 

IIIIIMIMIIIIIIIIIIMMIIIIIMIIIIIIIIIIIMIIIIIIMIIIIIIII 

Db 24 6 ATGCCAGTG^CTGGCCATGACGTGGAGGCCTACTGCCTGCTGTGCGAGTGCAGGTACGAG 3 05 

Qy 301 GAGCGCAGCACCACCACCATCAAGGTCATCATTGTCATCTACCTGTCCGTGGTGGGTGCC 3 60 

Db 3 06 GAGCGCAGCACCACCACCATCAAGGTCATCATTGTCATCTACCTGTCCGTGGTGGGTGCC 3 65 

Qy 361 CTGTTGCTCTACATGGCCTTCCTGATGCTGGTGGACCCTCTGATCCGAAAGCCGGATGCA 42 0 

IIIIIIIIIIIIIIMIIIIIIIMMMIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 366 CTGTTGCTCTACATGGCCTTCCTGATGCTGGTGGACCCTCTGATCCGAAAGCCGGATGCA 42 5 

OV 421 TACACTGAGCAACTGCACAATGAGGAGGAGAATGAGGATGCTCGCTCTATGGCAGCAGCT 4 8 0 

| Illlllllllllllllllllllllllll I II MM II I II II M I M I I II M 

Db 426 TACACTGAGCAACTGCACAATGAGGAGGAGAATGAGGATGCTCGCTCTATGGCAGCAGCT 4 85 

Qy 481 GCTGCATCCCTCGGGGGACCCCGAGCAAACACAGTCCTGGAGCGTGTGGAAGGTGCCCAG 54 0 

| I I I I I I I I I I 1 I I I 1 I I 1 I I I I t I I I I I I 1 I M I I I I 1 I I I I I 1 I I I I I I I 1 I I 1 I I I I 
Db 4 86 GCTGCATCCCTCGGGGGACCCCGAGCAAACACAGTCCTGGAGCGTGTGGAAGGTGCCCAG 54 5 

q v 541 CAGCGGTGGAAGCTGCAGGTGCAGGAGCAGCGGAAGACAGTCTTCGATCGGCACAAGATG 60 0 

1 1 1 1 1 1 [ 1 1 1 1 M 1 1 1 1 1 M I M 1 1 1 1 1 1 M ! 11 M 1 1 1 1 II 1 1 1 1 1 M 1 11 1 1 1 1 1 1 1 1 

Db 54 6 CAGCGGTGGAAGCTGCAGGTGC AGGAGCAGCGGAAGACAGTCTTCGATCGGCACAAGATG 605 

Qy 601 CTCAGCTAGATGGGCTGGTGTGGTTGGGTCAAGGCCCCAACACCATGGCTGCCAGCTTCC 660 

M I I I I I II II I I I i I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 06 CTCAGCTAGATGGGCTGGTGTGGTTGGGTCAAGGCCCCAACACCATGGCTGCCAGCTTCC 665 



Qy 


661 


AGGCTGGACAAAGCAGGGGGCTACTTCTCCCTTCCCTCGGTTCCAGTCTTCCCTTTAAAA 
■ ■ ■ i i i i i i i ■ i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i I I I i l 1 l l I I I 


720 


Db 


666 


M 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 M II M M 1 M 1 1 1 II II 1 II I 

AGGCTGGACAAAGCAGGGGGCTACTTCTCCCTTCCCTCGGTTCCAGTCTTCCCTTTAAAA 


725 


Qy 


721 


GCCTGTGGCATTTTTCCTCCTTCTCCCTAACTTTAGAAATGTTGTACTTGGCTATTTTGA 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I I I I I I I I l 1 1 i 1 1 1 1 1 1 111 


780 


Db 


726 


II 1 1 1 1 1| M 1 1 1 II M II II II 11 1 1 1 1 1 M 1 M 1 1 II 1 1 1 1 1 1 1 M II II 1 II II 1 1 1 

GCCTGTGGCATTTTTCCTCCTTCTCCCTAACTTTAGAAATGTTGTACTTGGCTATTTTGA 


785 


Qy 


781 


TTAGGGAAGAGGGATGTGGTCTCTGATCTCTGTTGTCTTCTTGGGTCTTTGGGGTTGAAG 
i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i ! i i i i i i i i i i i i i I 1 1 1 1 1 l I 1 1 1 


840 


Db 


786 


1 M 1 II II II II II 1 II II II II 1 M M 1 1 1 1 1 1 M M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

TTAGGGAAGAGGGATGTGGTCTCTGATCTCTGTTGTCTTCTTGGGTCTTTGGGGTTGAAG 


845 


Qy 


841 


GGATGGGGAAGGCAGGCCAGAAGGGAATGGAGACATTCGAGGCGGCCTCAGGAGTGGATG 

lii i i i I I i i i I I i I I I i I 1 I I 1 1 1 ! 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 


900 


Db 


846 


Ml | | M || | | | M | | | | || M M 1 M 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

GGAGGGGGAAGGCAGGCCAGAAGGGAATGGAGACATTCGAGGCGGCCTCAGGAGTGGATG 


905 


Qy 


901 


CGATCTGTCTCTCCTGGCTCCACTCTTGCCGCCTTCCAGCTCTGAGTCTTGGGAATGTTG 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I I l l 1 I 1 1 1 1 


960 


Db 


906 


1 1 1 1 M II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 II II 1 II II M II 1 II II 1 1 II 1 1 1 M 1 1 1 1 II 1 II 

CGATCTGTCTCTCCTGGCTCCACTCTTGCCGCCTTCCAGCTCTGAGTCTTGGGAATGTTG 


965 


Qy 


961 


TTACCCTTGGAAGATAAAGCTGGGTCTTCAGGAACTCAGTGTCTGGGAGGAAAGCATGGC 
i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 


1020 


Db 


966 


II 1 1 1 1 II 1 II II II II 1 II II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 

TTACCCTTGGAAGATAAAGCTGGGTCTTCAGGAACTCAGTGTCTGGGAGGAAAGCATGGC 


1025 


Qy 


1021 


CCAGCATTCAGCATGTGTTCCTTTCTGCAGTGGTTCTTATCACCACCTCCCTCCCAGCCC 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 i i 1 1 1 | 1 1 1 1 I 1 


1080 


Db 


1026 


|| | | | || I II I II II 1 1 1 1 II 1 II II 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 

CCAGCATTCAGCATGTGTTCCTTTCTGCAGTGGTTCTTATCACCACCTCCCTCCCAGCCC 


1085 


Qy 


1081 


CAGCGCCTCAGCCCCAGCCCCAGCTCCAGCCCTGAGGACAGCTCTGATGGGAGAGCTGGG 

, i i i i i i i i i i i i i i i i i i i i i i i i i i i I I I I l l l 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

| | | || || || | | || | 1 II II 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II II 1 1 1 
CAGCGCCTCAGCCCCAGCCCCAGCTCCAGCCCTGAGGACAGCTCTGATGGGAGAGCTGGG 


1140 


Db 


1086 


1145 


Qy 


1141 


CCCCCTGAGCCCACTGGGTCTTCAGGGTGCACTGGAAGCTGGTGTTCGCTGTCCCCTGTG 

■ i i i i i i i i i i i i > t i i i I 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 


1200 


Db 


1146 


| | || || | | | | | | || || | II 1 II 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

CCCCCTGAGCCCACTGGGTCTTCAGGGTGCACTGGAAGCTGGTGTTCGCTGTCCCCTGTG 


1205 


Qy 


1201 


CACTTCTCGCACTGGGGCATGGAGTGCCCATGCATACTCTGCTGCCGGTCCCCTCACCTG 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i r i i i i i i i i I I I I 
|| || || | || II 1 1 1 II II II 1 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N II 

CACTTCTCGCACTGGGGCATGGAGTGCCCATGCATACTCTGCTGCCGGTCCCCTCACCTG 


1260 


Db 


1206 


1265 


Qy 


1261 


CACTTGAGGGGTCTGGGCAGTCCCTCCTCTCCCCAGTGTCCACAGTCACTGAGCCAGACG 
* i i i i i i i i i i i i i i i i i i i i i i i i t i i i I I l l l 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 


1320 


Db 


1266 


M M M M 1 M 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

CACTTGAGGGGTCTGGGCAGTCCCTCCTCTCCCCAGTGTCCACAGTCACTGAGCCAGACG 


1325 


Qy 


1321 


GTCGGTTGGAACATGAGACTCGAGGCTGAGCGTGGATCTGAACACCACAGCCCCTGTACT 
, i i i , i , , , i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i [ i i | | | | | | 

1 1 M M 1 M 1 M 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 II 1 M 

GTCGGTTGGAACATGAGACTCGAGGCTGAGCGTGGATCTGAACACCACAGCCCCTGTACT 


1380 


Db 


1326 


1385 


Qy 


1381 


TGGGTTGCCTCTTGTCCCTGAACTTCGTTGTACCAGTGCATGGAGAGAAAATTTTGTCCT 


1440 


Db 


1386 


MMIIMIIMIIIIIMIMIIIIIIIIIMIIMIIIIIMIIMIIIIIIMIIII 

TGGGTTGCCTCTTGTCCCTGAACTTCGTTGTACCAGTGCATGGAGAGAAAATTTTGTCCT 


1445 


Qy 


1441 


CTTGTCTTAGAGTTGTGTGTAAATCAAGGAAGCCATCATTAAATTGTTTTATTTCTCTC 

1 ! 1 1 1 1 1 1 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTGTCTTAGAGTTGTGTGTAAATCAAGGAAGCCATCATTAAATTGTTTTATTTCTCTC 


1499 


Db 


1446 


1504 



